Search CORE

108 research outputs found

More is Less, Less is More: Molecular-Scale Photonic NoC Power Topologies

Author: Alvin R Lebeck
Chris Dwyer
Jun Pang
Publication venue
Publication date: 24/04/2020
Field of study

Abstract Molecular-scale Network-on-Chip (mNoC) crossbars use quantum dot LEDs as an on-chip light source, and chromophores to provide optical signal filtering for receivers. An mNoC reduces power consumption or enables scaling to larger crossbars for a reduced energy budget compared to current nanophotonic NoC crossbars. Since communication latency is reduced by using a high-radix crossbar, minimizing power consumption becomes a primary design target. Conventional Single Writer Multiple Reader (SWMR) photonic crossbar designs broadcast all packets, and incur the commensurate required power, even if only two nodes are communicating. This paper introduces power topologies, enabled by unique capabilities of mNoC technology, to reduce overall interconnect power consumption. A power topology corresponds to the logical connectivity provided by a given power mode. Broadcast is one power mode and it consumes the maximum power. Additional power modes consume less power but allow a source to communicate with only a statically defined, potentially non-contiguous, subset of nodes. Overall interconnect power is reduced if the more frequently communicating nodes use modes that consume less power, while less frequently communicating nodes use modes that consume more power. We also investigate thread mapping techniques to fully exploit power topologies. We explore various mNoC power topologies with one, two and four power modes for a radix-256 SWMR mNoC crossbar. Our results show that the combination of power topologies and intelligent thread mapping can reduce total mNoC power by up Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]. to 51% on average for a set of 12 SPLASH benchmarks. Furthermore performance is 10% better than conventional resonator-based photonic NoCs and energy is reduced by 72%

CiteSeerX

Mechanisms for cooperative shared memory

Author: Chandra Satish
Falsafi Babak
Hill Mark D.
Larus James R.
Lebeck Alvin R.
Lewis James C.
Mukherjee Shubhendu S.
Palacharla Subbarao
Reinhardt Steven K.
Wood David A.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 06/04/2009
Field of study

This paper explores the complexity of implementing directory protocols by examining their mechanisms - primitive operations on directories, caches, and network interfaces. We compare the following protocols: Dir1B, Dir4B, Dir4NB, DirnNB, Dir1SW and an improved version of Dir1SW (Dir1SW+). The comparison shows that the mechanisms and mechanism sequencing of Dir1SW and Dir1SW+ are simpler than those for other protocols. We also compare protocol performance by running eight benchmarks on 32 processor systems. Simulations show that Dir1SW+'s performance is comparable to more complex directory protocols. The significant disparity in hardware complexity and the small difference in performance argue that Dir1SW+ may be a more effective use of resources. The small performance difference is attributable to two factors: the low degree of sharing in the benchmarks and Check-In/Check-Out (CICO) directives

Infoscience - École polytechnique fédérale de Lausanne

Fine-grain access control for distributed shared memory

Author: Alvin R. Lebeck
Babak Falsafi
Cheriton David R.
Dally William J.
David A. Wood
Falsafi Babak
Ioannis Schoinas
James R. Larus
Nowatzyk A.
Steven K. Reinhardt
Uhlig Richard
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Modeling and simulation of a nanoscale optical computing system

Author: Achermann
Alvin R. Lebeck
Anni
Beetz
Biberman
Biswas
Cho
Christopher Dwyer
de Silva
Drobychev
Dwyer
Foresi
Glesener
Huang
Joshi
Jun Pang
Kim
Kirman
Klimov
Klimov
Klimov
Lakowicz
Ouyang
Petkova
Pistol
Pistol
Pistol
Vandeweyer
Vantrease
Wang
Wang
Zhang
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Cache conscious programming in undergraduate computer science

Author: Alvin R. Lebeck
Publication venue: ACM
Publication date: 01/01/1999
Field of study

performance potential of fast processors, programmers The wide-spread use of microprocessor based systems must explicitly consider cache behavior, restructuring their that utilize cache memory to alleviate excessively long codes to increase locality. DRAM access times introduces a new dimension in the As fast processors proliferate, techniques for improving quest to obtain good program performance. To fully cache performance must move beyond the supercomputer, exploit the performance potential of these fast processors, multiprocessor, and academic research communities and programmers must reason about their program’s cache into the mainstream of computing. To expedite this trans-performance. Heretofore, this topic has been restricted to fer of knowledge, as part of the CURIOUS (Center for the supercomputer, multiprocessor, and academic research Undergraduate education and Research: Integration community. It is now time to introduce this topic into thrOUgh performance and viSualization) project at Duke undergraduate computer science curriculum

CiteSeerX

Crossref